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Abstract 

Given any fixed integer q > 2, a g-monomial is of the format x'^x"' ■ ■ ■ a;|' such 
that 1 < Sj < q — 1, 1 < j < t. q- monomials are natural generalizations of 
multilinear monomials. Recent research on testing multilinear monomials and 
g-monomails for prime q in multivariate polynomials relies on the property that 
Z q is a field when q > 2 is prime. When q > 2 is not prime, it remains open 
whether the problem of testing g-monomials can be solved in some compati- 
ble complexity. In this paper, we present a randomized (9*(7.15 fc ) algorithm for 
testing g-monomials of degree k that are found in a multivariate polynomial that 
is represented by a tree-like circuit with a polynomial size, thus giving a pos- 
itive, affirming answer to the above question. Our algorithm works regardless 
of the primality of q and improves upon the time complexity of the previously 
known algorithm for testing g-monomials for prime q > 7. 

Keywords: Algebra; complexity; multivariate polynomials; monomials; mono- 
mial testing; randomized algorithms. 



1. Introduction 

1.1. Background 

Recently, significant efforts have been made towards studying the pro blem 
of testing monomials in multivariate polynomials 



waras studying tne pro blem 

Slil&SOlSlIIIIl, 



with the central question consisting of whether a multivariate polynomial rep- 
resented by a circuit (or even simpler structure) has a multilinear (or some 
specific) monomial in its sum-product expansion. This question can be an- 
swered straightforwardly when the input polynomial has been expanded into a 
sum-product representation, but the dilemma, though, is that obtaining such 
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a representation generally requires exponential time. The motivation and ne- 
cessity of studying the monomial testing problem can be clearly understood 
from its connections to various critical problems in computational complexity 
as well as the possibilities of applying algebraic properties of polynomials to 
move forward the research on those critical problems (see, e.g., [10|). 

Historically, polynomials and the studies thereof have, time and again, con- 
tributed to many advancements in theoretical computer science research. Most 
notably, many major breakthroughs in complexity theory would not have been 
possible without the invaluable roles played by low degree polynomial test- 
ing/representing and polynomial identity testing. For example, low degree poly- 
nomial testing was involved in the proof of the PCP Theorem, the cornerstone 
of the theory of computational hardness of approximation and the culmination 
of a long line of research on IP and PCP (see, Arora et al. j3j and Feige et al. 
(l4l|). Polynomial identity testing has been extensively studied due to its role 
in various aspects of theoretical computer science (see, for example, Kabanets 
and Impagliazzo [161 ]) and its applications in various fundamental results such 
as Shamir's IP=PSPACFjp| and the AKS Primality Testing Q. Low degree 
polynomial representing [19( has been sought after in order to prove important 
results in circuit complexity, complexity class separation and sub-exponential 
time learning of Boolean functions (see, for examples, Beigel 0, Fu 15], and 
Klivans and Servedio [13] )• Other breakthroughs in the field of algorithmic de- 
sign have also been achieved by combinations of randomization and algebriza- 
tion. Randomized algebraic techniques have led to the randomized algorithms 
of time 0*(2 k ) for the fc-PATH problem and other problems 3, 22|. Another 
recent seminal example is the improved randomized 0(1.657") time algorithm 
for the Hamiltonian path problem by Bjorklund [fj. This algorithm provided a 
positive answer to the question of whether the Hamiltonian path problem can 
be solved in time 0(c n ) for some constant < c < 2, a challenging problem that 
had been open for half of a century. Bjorklund et al. further extended the above 
randomized algorithm to the A;-path testing problem with 0*(1.657 fc ) time com- 
plexity ML Very recently, those two algorithms were simplified by Abasi and 
Bshouty [l| . These are just a few examples and a survey of related literature is 
beyond the scope of this paper. 



1.2. The Related Work 

The problem of testing multilinear monomials in multivariate polynomials 
was initially exploited by Koutis [3] and then by Williams [22[ to design ran- 
domized parameterized algorithms for the fc-path problem. Koutis 

.18] 

initially 

developed an innovative group algebra approach to testing multilinear monomi- 
als with odd coefficients in the sum-product expansion of any given multivariate 
polynomial. Williams [22[ then further connected the polynomial identity test- 
ing problem to multilinear monomial testing and devised an algorithm that can 
test multilinear monomials with odd or even coefficients. 

The work by Chen et al. 0, EH, Ell EH EH aimed at developing a theory of 
testing monomials in multivariate polynomials in the context of a computational 
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complexity study. The goal was to investigate the various complexity aspects 
of the monomial testing problem and its variants. 

Initially, Chen and Fu [13 proved a series of foundational results, beginning 
with the proof that the multilinear monomial testing problem for nXM polyno- 
mials is NP-hard, even when each factor of the given polynomial has at most 
three product terms and each product term has a degree of at most 2. These 
results have built a base upon which further study of testing monomials can 
continue. 

Subsequently, Chen et al. [HI (see, also, [HI) studied the generalized q- 
monomial testing problem. They proved that when q > 2 is prime, there is 
a randomized 0*(q k ) time algorithm for testing q- monomials of degree k with 
coefficients ^ (mod q) in an arithmetic circuit representation of a multivariate 
polynomial which can then be derandomized into a deterministic 0*((6.4p) fc ) 
time algorithm when the underlying graph of the circuit is a tree. 

In the third paper, Chen and Fu @ (and [Til] ) turned to finding the co- 
efficients of monomials in multivariate polynomials. Naturally, testing for the 
existence of any given monomial in a polynomial can be carried out by com- 
puting the coefficient of that monomial in the sum-product expansion of the 
polynomial. A zero coefficient means that the monomial is not present in the 
polynomial, whereas a nonzero coefficient implies that it is present. Moreover, 
they showed that coefficients of monomials in a polynomial have their own im- 
plications and are closely related to core problems in computational complexity. 



1.3. Contribution and Organization 

Recent research on testing multilinear monomials and g-monomails for prime 
q in multivariate polynomials relies on the property that Z-i and Z q are fields 
only when q > 2 is prime. When q > 2 is n ot p rime, Z„ is no longer a field, 
hence the group algebra based approaches in [13, [22j [l3j, [l2| are not applicable 
to cases of non-prime q. It remains open whether the problem of testing q- 
monomials can be solved in some compatible complexity for non-prime q. Our 
contribution in this paper is a randomized 0*(7.15 fe s 2 (n)) algorithm for testing 
g-monomials of degree k in a multivariate polynomial represented by a tree-like 
circuit of size s(n), thus giving an affirming answer to the above question. Our 
algorithm works for both prime q and non-prime q as well. Additionally, for 
prime q > 7, our algorithm provides us with some substantial improvement 
on the time complexity of the previously known algorithm [l3l . [l2| for testing 
g-monomials. 

The rest of the paper is organized as follows. In Section 2, we introduce 
the necessary notations and definitions. In Section 3, we examine three exam- 
ples to understand the difficulty to transform g-monomial testing to multilinear 
monomial testing. In Section 4, we propose a new method for reconstructing 
a given circuit and a technique to replace each occurrence of a variable with a 
randomized linear sum of q — 1 new variables. We show that, with the desired 
probability, the reconstruction and randomized replacements help transform the 
testing of (/-monomials in any polynomial represented by a tree-like circuit to the 
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testing of multilinear monomial in a new polynomial. We design a randomized 
g-monomial testing algorithm in Section 5 and conclude the paper in Section 6. 

2. Notations and Definitions 

For 1 < ii < ■ ■ ■ < ik < n, n = ■ ■ ■ x\* is called a monomial. The degree 



of 7r, denoted by deg(7r), is ^2 Sj. ir is multilinear, if si = ■ ■ ■ = St = 1, i.e., tt 



is linear in all its variables x^, . . . , X{ t . For any given integer q > 2, ir is called 
a q- monomial if 1 < s\, . . . , s t < q — 1. In particular, a multilinear monomial is 
the same as a 2-monomial. 

An arithmetic circuit, or circuit for short, is a directed acyclic graph consist- 
ing of + gates with unbounded fan-ins, x gates with two fan-ins, and terminal 
nodes that correspond to variables. The size, denoted by s(n), of a circuit with 
n variables is the number of gates in that circuit. A circuit is considered a 
tree-like circuit if the fan-out of every gate is at most one, i.e., the underlying 
directed acyclic graph that excludes all the terminal nodes is a tree. In other 
words, in a tree-like circuit, only the terminal nodes can have more than one 
fan-out (or out-going edge). 

Throughout this paper, the 0*{-) notation is used to suppress poly(n, k) 
factors in time complexity bounds. 

By definition, any polynomial F(xi, . . . , x n ) can be expressed as a sum of a 
list of monomials, called the sum-product expansion. The degree of the polyno- 
mial is the largest degree of its monomials in the expansion. With this expanded 
expression, it is trivial to see whether F{x\, . . . , x n ) has a multilinear monomial, 
or a monomial with any given pattern. Unfortunately, such an expanded expres- 
sion is essentially problematic and infeasible due to the fact that a polynomial 
may often have exponentially many monomials in its sum-product expansion. 

In general, a polynomial F{x\, . . . , x n ) can be represented by a circuit. This 
type of representation is simple and compact and may have a substantially 
smaller size polynomially in n, when compared to the number of all monomi- 
als in its sum-product expansion. Thus, the challenge then is to test whether 
F(xi, . . . , x n ) has a multilinear (or some other desired) monomial efficiently, 
without expanding it into its sum-product representation. 

For any given nx n matrix A, let perm(^l) denote the permanent of A and 
det(^4) the determinant of A. 

For any integer k > 1, we consider the group Z$ with the multiplication 
• defined as follows. For fc-dimensional column vectors x, y £ with x = 
(xi, . . . , x k ) T and y = (y 1 , . . . , y k ) T , x ■ y = (xi + yi, . . . , x k + y k ) T ■ vo = 
(0, . . . , 0) T is the zero element in the group. For any field J 7 , the group algebra 
T\Z^\ is defined as follows. Every element u £ T\Z^\ is a linear addition of the 
form 



u 




(1) 
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For any element v — hxi, we define 

u + v = (a,i+bi)xi, and 

u-v = (aibj){xi-yj). 

a,, bj<=T, and Si, yj£Z% 

For any scalar c G J 7 , 

cu = c = ^2 {ca.i)xi. 

The zero element in the group algebra ^[Z^] is = Ov, where is the zero 

V 

element in T and v is any vector in Z\ . For example, = Ot^o = 0v\ + 0v2 + O1T3, 
for any Vi £ Z\^ 1 < i < 3. The identity element in the group algebra J-[Z%] 
is 1 = l«o = ^o, where 1 is the identity element in T . For any vector v — 
(«!,..., w fc ) T S Z|, for t > 0, let (w) 1 = (u>i, . . . , iv k ) T . When the field T is Z 2 
with respect to (mod 2) operation, for any x, y G Z2, and x + y stands for 
xy (mod 2) and x + y (mod 2), respectively. In particular, in the group algebra 
Z%[Z$], for any z G Z\ we have (v)° = (v) 2 = «o- 



3. q-Monomials, Multilinear Monomials and Plus Gates 

As we pointed out before, group algebra based algorithms 3, 22, 13[ HJ 
cannot be called upon to test g-monomials when q is not prime, because Z q is 
not a field. Hence, in such a case the algebraic foundation for applying those 
algorithms is no longer available. 

It seems quite hopeful that there might be a way to transform the problem 
of testing q-monomials into the problem of testing multilinear monomials and 
thus utilize the existing techniques for the latter problem to solve the former 
problem. One plausible strategy to accomplish such a transformation is to 
replace each variable a; in a given multivariate polynomial by a sum y\ + j/2 + 
• • • + of q — 1 new variables. Ideally, such replacements should result in 
a multilinear monomial in the new polynomial that corresponds to the given 
^-monomial in the originalx polynomial and vice versa, thereby allowing the 
multilinear monomial testing algorithm based on some group algebra over a field 



of characteristic 2 (18l.l22| to be adopted for the testing of multilinear monomials 
in the new polynomial. Unfortunately, some careful analysis will reveal that 
this approach has, as exhibited in Example [I] a profound technical barrier that 
prevents us from applying those mulilinear monomial testing algorithms. 
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Example 1. Consider a simple 4- monomial ir — x 3 of degree 3. Replacing x 
with yi + ?/2 + J/3 in 7r results in 

K 71 ") = (yi + 2/2 + 2/3) 3 

= 2/i + 2/2 + 2/1 + 3 2/i2/2 + 3yiy| + 3y%y 3 + 3y 2 yf + 3j/iy 3 + Zyxyl 
+ 6yi2/22/3- 

r(7r) has one and only one degree 3 multilinear monomial n' = yxyiyz- It is 
unfortunate that the coefficient c(tt') of n' is 6, an even number. When applying 
the group algebra based multilinear monomial testing algorithms to r(ir) over 
the field Z2 with respect to (mod 2) operation, the even coefficient c(ir') will 
help eliminate w' from r(ir). Hence, we are unable to find the existence of any 
multilinear monomials in the sum-product expansion of r(7r). 

Knowing that the above example can be generalized to arbitrary g-monomials 
for q > 2, we have to design an innovative replacement technique so that certain 
multilinear monomials in the new polynomial will survive the elimination by the 
(mod 2) operation over Z2, or by the characteristic 2 property over any field 
of characteristic 2. Specifically, we have to ensure, with complete or desired 
probabilistic certainty, that a given q-monomial it with coefficient c(tt) in the 
original polynomial will correspond to one or a list of "distinguishable" multi- 
linear monomials with odd coefficients in the derived polynomial, regardless of 
the parity of c(n). 




Fig. 1. A Circuit for ir = x 3 Fig. 2. The Expanded Circuit for tt = x 3 

When group algebraic elements are selected to replace variables in the input 
polynomial, the polynomial might become zero due to mutual annihilation of 
the results from a list of multilinear monomials with odd coefficients. Koutis 
(l8j proved that when those group algebraic elements are uniform random, with 
a probability at least j, the input polynomial that has multilinear monomials 
with odd coefficients will not become zero, even if mutual annihilation of the 
results from a list of multilinear monomials with odd coefficients may happen. 
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Williams 22J introduced a new variable for each x gate in the representa- 
tive circuit for the input polynomial that can help avoid the aforementioned 
mutual annihilation. In essence, the new variables added for the x gates can 
help generate one or a list of " distinguishable" multilinear monomials with odd 
coefficients in the derived polynomial, no matter whether the coefficient of the 
original multilinear monomial is even or odd. However, this approach cannot 
help resolve the g-monomial testing problem, due to possible implications of + 
gates. 

In order to understand the above situation, let us examine Example 1 again. 
Following Williams's algorithm, we first reconstruct the circuit in Figure 1. 
The expanded circuit, after the replacement of x by y\ + j/2 + 2/3 along with the 
addition of new variables z\ and z 2 for the two respective + gates, is shown in 
Figure 2. The coefficient for the only multilinear monomial y\y 2 y^ produced by 
the new circuit is 6ziz 2 , which is even and thus helps annihilate 2/1J/2J/3 with 
respect to (mod 2) operation or in general the characteristic 2 property of the 
underlying field. 

The following two examples provide us with more evidences that there are 
technical difficulties in dealing with possible implications of + gates. 




Fig. 3. A Circuit for F(x\,X2) Fig. 4. The Reconstructed Circuit 

for F(xi,x 2 ) 

Example 2. Let F(xi, X2) — 2xfx 2 +2x 2 as represented by the circuit in Figure 
3. F has one 5-monomial tt\ — x\x 2 and one 3-monomial tt 2 — x\ : each of which 
has a coefficient 2. 

When one follows the approach by Williams [22J to add, for each x gate 
in Figure 3, a new x gate that multiplies the output of this gate with a new 
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variable, then one obtains a new circuit in Figure 4 that computes 

F'(z ll z 2 , . . . , z 7 , Xi,x 2 ) — z 1 z 3 z 5 z 7 x^x 2 + Z3Z±z & z 7 x\x2 + 2z 7 x\. 

Although 2x\x 2 in F is spilt into two distinguishable occurrences that have 
respective unique coefficients z\z 3 z^z 7 and z^z^z^z-j, yet 2x^ in F corresponds 
to 2z 7 x\ that has an even coefficient 2z 7 . 

In particular, the implications of + gates on testing multilinear monomials 
can be seen from the following example. 

Example 3. Let G(xi,X2,%a) — 2x\x 3 + 2x 2 x 3 . Changing the terminal node 
X2 to X3 for the top x gate in Figure 3 (respectively, for the top second x gate 
in Figure 4) gives a circuit to compute G (respectively, G'). 

Like in Example 2, G'(zi, z 2 , £3, xi, x 2 , 23) = Z\z^z^z 7 x\x% + z 3 z^z^z 7 x\x 3 + 
2z 7 x 2 x 3 - Here, 2x\x 3 is spilt into two distinguishable occurrences that have 
unique coefficients z\Zzz$z 7 and z^z^zqz-j, respectively. However, the only mul- 
tilinear monomial 2x 2 x 3 in G corresponds to 2z 7 x 2 x 3 that has an even coefficient 
2z 7 . Therefore, this multilinear monomial cannot be detected by Williams' al- 
gorithm. 

Example 3 exhibits that there is a flaw in the circuit reconstruction by 
Williams [22j: Introducing a new variable to multiply the output of every x 
gate is not sufficient to overcome the difficulty that may possibly be caused by 
+ gates. 

4. Circuit Reconstruction and A Transformation 

In this section, we shall design a new method to reconstruct a given circuit 
and a randomized variable replacement technique so that we can transform, 
with some desired success probability, the testing of g-monomials to the testing 
of multilinear monomials. 

To simplify presentation, we assume from now on through the rest of the 
paper that if any given polynomial has g-monomials in its sum-product expan- 
sion, then the degrees of those multilinear monomials are at least k and one of 
them has exactly a degree of k. This assumption is feasible, because when a 
polynomial has g-monomials of degree < k, e.g., the least degree of those is £ 
with 1 < £ < k, then we can multiply the polynomial by a list of k — £ new 
variables so that the resulting polynomial will have g-monomials with degrees 
satisfying the aforementioned assumption. 

4-1- Circuit Reconstruction 

For any given polynomial F{x\, x 2 , . . . , x n ) represented by a tree-like circuit 
C of size s(n), we first reconstruct the circuit C in three steps as follows. 

Eliminating redundant + gates. Starting with the root gate, check to 
see whether a + gate receives input from another + gate. If a + gate g receives 
input from a + gate /, which receives inputs from gates /1, f 2 , . . . , f s and/or 
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terminal nodes U\ 1 U2, ■ ■ ■ , u t , then delete / and let the gate g to receive inputs 
directly from fi, f2, ■ ■ ■ , fs and/or iti, it2, • ■ • , Ut- Repeat this process until there 
are no more + gates receiving input from another + gate. 

Note that we consider tree-like circuits only. Since each gate of such a circuit 
has at most one output, the above eliminating process will not increase the size 
of the circuit. 

Duplicating terminal nodes. For each variable Xi, if xi is the input to a 
list of gates g\, g2, ■ ■ ■ , g£, then create I terminal nodes u±,U2, ■ ■ ■ ,ue such that 
each of them represents a copy of the variable Xi and gj receives input from Uj , 
±<3<L 

Let C* denote the reconstructed circuit after the above two reconstruction 
steps. Since the original circuit C is tree-like, the underlying graph of C* , includ- 
ing all the terminal nodes, is a tree. Such a tree structure implies the following 
simple facts: 

• There is no duplicated occurrence of any input variable along any path 
from the root to a terminal node. 

• Every occurrence of each variable Xi in the sum-product expansion of F 
is represented by a terminal node for Xi. 

• The size of the new circuit is at most ns(n). 

• Any + gate will receive input from x gates and/or terminal nodes. 




Z1 



Z2 



2^ 




iti ii 



X1 



Fig. 5. The New Circuit for F{x 1 ,x 2 ) = 2x\x 2 + 2x\ 

Adding new variables for x gates and for those terminal nodes 
that directly connect to + gates. Having completed the reconstruction for 
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C*, we then expand it to a new circuit C as follows. For each x gate gt in C*, 
we attach a new x gate g[ that multiplies the output of gi with a new variable 
Zj, and feed the output of g[ to the gate that reads the output of g^. Here, the 
way of introducing new variables for x gates follows what is done by Williams 
in |22j |. However, in addition to these new z- variables, we may need to introduce 
additional variables for + gates. Specifically, for each + gate / that receives 
inputs from terminal nodes Ui,tt2, • • • ,Ut, we add a x gate fj and have it to 
receive inputs from Uj and a new variable Zj and then feed its output to /, 
1 < j < t. Note that / may receive input from x gates but no new gates are 
needed for those gates with respect to /. 

Assume that a list of h new z-variables z%, z 2 , . . . , z% have been introduced 
into the circuit C Let F'(zi, Z2, ■ ■ • , Zh, Xi, X2, ■ • ■ , x n ) be the new polynomial 
represented by C. 

In Figure 5, we show the reconstructed circuit for the one in Figure 3 that 
represents F(xi, X2) — 2x\x 2 + 2x\. By this new circuit, 

F'(z\,Z2, ■ ■ ■ , zq, xi, X2) = ziz 2 z b z 7 x\x 2 + z 3 Z4z e z7xjx 2 + z 7 z s xl + Z-jZgx\. 

As expected, not only is 2x\x 2 in F split into two distinguishable occurrences 
that have unique coefficients Z1Z2Z5Z7 and Z3Z4Z6Z7, but also 2x 2 in F is split 
into two distinguishable occurrences that have unique coefficients zjzg, and z-jZg. 
Notably, those four coefficients are multilinear monomials of z-variables and each 
has an odd scalar coefficient 1. 

Lemma 1. F{x\ 1 x 2l . . . ,x n ) has a monomial ir of degree k in its sum-product 
expansion if and only if there is a monomial an in the sum-product expansion 
of F'(zi, z 2 , . . . , Zh, X\, x 2 , . . . , x n ) such that a is a multilinear monomial of z- 
variables with degree < 2k — 1. Furthermore, if F' has two products a\ir and 
a 2 n in its sum-product expansion, then we have ot\ ^ a 2 , where ot\ and a 2 are 
products of z-variables; and any two different monomials of x-variables in F' 
will have different coefficients that are products of z-variables. 

Proof. By the reconstruction processes, C* computes exactly the same poly- 
nomial F. If F has a monomial it of degree k, then let T be the subtree of C* 
that generates the monomial tt, and T' be the corresponding subtree of T in C 
By the way the new z-variables are introduced, the monomial generated by T' 
is air with a as the product of all the z-variables added to T to yield T'. Since 
7r has degree k, T has k — 1 many x gates. So, T 1 has k — 1 new x gates along 
with k — 1 many new z-variables that are added with respect to those x gates 
in T. In addition, T' has k terminal nodes representing k individual copies of 
x-variables in it. When such a terminal node is connected to a + gate, then a 
new x gate is added along with a new z-variable. Thus, the terminal nodes in 
T' can contribute at most k additional z-variables. Therefore, the degree of a 
is at most 2k — 1. Since all those z-variables are distinct, a is multilinear. 

If F' has a monomial air such that a is a product of z-variables and ir is 
a product of x-variables, then let M! be the subtree of C that generates cot. 
According to the construction of C* and C, removing all the z-variables along 
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with the newly added x gates from M! will result in a subtree A4 of C* that 
generates ir. Thereby, ir is a monomial in F. 

Assume that F' has ctiir and a 2 ir in its sum-product expansion, where a\ 
and a2 are products of z-variables. Let T{ and be the two subtrees in C 
that generate ai7r and a,2ir, respectively. Since each of such subtrees in C' can 
be used once to generate one product in the sum-product expansion of F', we 
have T{ ^ Let 7i and T2 be the two respective subtrees of T{ and TJ in 
C*. By the ways of circuit reconstruction and introduction of new z-variables, 
T\ 7^ T-i implies T\ ^ Ti- Note that T\ and T2 generates the same ir. There are 
two cases for T\ and T2 to differ: either 7i and T2 differ at a x gate g, or they 
have the same x gates but differ at a terminal node u. In the former case, the 
z-variables added with respect to g will make a\ and a 2 different. In the latter 
case, we assume without loss of generality that 71 has a terminal node u but 
T2 does not. In this case, the parent node u' of u has to be a + gate. Hence, a 
new z-variable is added for the new x gate between u' and u. Therefore, this 
new z-variable makes u\ and a.2 different. 

Now, consider that F' has two monomials air and /3<j> such that, ir and <j> 
are products of x-variables and a and (3 are products of z-variables. Let 
and W 2 be the subtrees in C that generate air and /30, respectively. Again, 
according to the construction of C* and C, removing all the z-variables along 
with the newly added x gates from H'i and H' 2 will result in two subtrees 
"Hi and %2 of C* that generate ir and (f>, respectively. When ir ^ (f>, Hi and 
H2 are different subtrees. Following a similar analysis in the above paragraph 
for 71 and T2 to be different, we have a ^ (3. Also, since the z- variables in a 
corresponds to x gates in H'i that do not repeat themselves because "H'i is a 
tree, a is multilinear. Similarly, j3 is also multilinear. 

Combining the above analysis completes the proof for the lemma. 

4-2. A Transformation 

In order to present the technique to transform the testing of g-monomials to 
the testing of multilinear monomials, we introduce one more definition related 
to variable replacements. 

Definition 1. Let q > 2 be a fixed integer. Let ir = x s for 1 < s < q — 1. 
Consider 



where Cij are constants and yj are new variables, 1 < i < s and 1 < j < q — 1. 
For 1 < s < q — 1, let ir' = j/ij/2 • • • Us- Define the coefficient matrix of ir' with 
respect to r(ir) as 



s 



r ( 7F ) = Il^ Ca2/1 + Ci2V2 ^ h C i(q-l)Vq-l) 



C[ir',r(ir)] = 



( Cn C12 

C21 c 2 2 




\ C s i c s2 
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Transformation: For any given n-variate polynomial F(xi,X2, • ■ • , x n ) rep- 
resented by a circuit C, we first carry out the circuit reconstruction as addressed 
in Subsection l4.1l to obtain a new circuit C and let F'(zi , z-x , . . . , z^, x\, X2, ■ ■ ■ , x n ) 
be the new polynomial represented by C. The transformation through replacing 
x-variables works as follows: For each variable Xi and for each terminal node 
Uj representing Xi in circuit C , select uniform random values c%jt from Z 2 and 
replace x\ at the node Uj with 

r(xi) = (cijiyn +c ij2 yi2 H Cij{q-i)Vi{ q -i))- (2) 

Let 

G(z%,. . . , z h ,yn, . . . , 1) , • ■ • , VnXi ■ ■ ■ > Vn(q-i)) 

be the polynomial resulted from the above replacements for circuit C . 

We need Lemmas [2] and [3] in the following to help estimate the success 
probability of the transformation. 

Consider the vector space Z^, For any vector vi, V2, . ■ ■ , Vk S Z^, 1 < k < n, 
let span(«i, v 2 , ■ ■ ■ , Wfe) denote the linear space generated by those k vectors. The 
following lemma follows directly from Lemma 6.3.1 of Blum and Kannan in Q. 

Lemma 2. [8] Assume that t?i, #2, . . . , Vk are random vectors uniformly chosen 
from Z2, 1 < k < n and n > 1. Let Pr[v*i, V2, ■ ■ ■ , Vk] denote the probability that 
vi , V2 , ■ ■ ■ , Vk are linearly independent. We have 

Pr\v x ,V2,...,v k ] >0.28. 

Koutis had a proof for Pr[«i, V2, . . . , Vk] > 2; which is contained in the proof 



for his Theorem 2.4 18]. But some careful examination will show that there is a 



flaw in the analysis for k = 3. Nevertheless, we present a proof in the following. 

Proof. From the basis of linear algebra, we know that span(«i, V2, . ■ . , Vk) has 
2 k vectors and any vector in Z^ — span(«i, U2, . . . , Vk) is linearly independent of 
vi,V2,...,v k . Note that \Z%\ = 2". Therefore, 

Pr[vi,v 2 ,...,v k ] = Pr[v k g span(« l5 v 2 , . . . , v k -i)} • Pr[vi, v 2 , . . . , v k -i] 

= i 1 - 2 n-fc+l )- Pr ^1^2,---,^fc-l] 
k 



The last inequality holds because of 1 < k < n. For any 1 < k < 40, by simply 
carrying out the computation for the right product of expression ([3]) , we obtain 

k 

- > 0.288788 > 0.28, 1 < k < 40, and (4) 

i=l 
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40 \ 

Ila-y) 

i=i / 



— > 0.281444 > 0.28. (5) 
41 - w 



It is obvious that 2 l > i 2 for i > 41. Combining this with expressions ([3]) and 
(5) yields, for any fe > 40, 



IK 1 -?) - iid -?) • nc 1 -?) 

i=l \ i=l / \ i=41 



2 l , , 

i=l / \ i=41 



»=1 / V i=41 



40 



1=1 

jfe + 1 



2 l/ / 41 & 



> 0.28- 

> 0.28 (6) 
The complete proof is then derived from expressions Q and ([6]). 

Lemma 3. For any integer matrix A — {a,ij) n xn> we have 

perm(A) (mod 2) = det(A) (mod 2). (7) 

Proof. Let A be any permutation of {1, 2, . . . , n}, and sign(A) be the sign of 
the permutation A. Since for any integer b, b = —b (mod 2), we have 

det{A) (mod 2) = ( ^(-l) sign(A) aiA(i)a 2 A(2) ■ ■ ■ a nX(n) ^ (mod 2) 

= ( ^2 a iA(i) a 2A(2) ' ' ' a n \(n) j (mod 2) 

= perm(yl) (mod 2). 

It is obvious that the above lemma can be easily extended to any field of 
characteristic 2. We are now ready to estimate the success probability of the 
transformation. 



Lemma 4. Assume that the variable replacements are carried out over a field T 
of characteristic 2 (e.g., If a given n-variate polynomial F(x\, X2, ■ ■ ■ , x n ) 

that is represented by a tree-like circuit C has a q-monomial of x -variables with 
degree k, then, with a probability at least 0.28 , G has a unique multilinear 
monomial an such that it is a degree k multilinear monomial of y-variables 
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and a is a multilinear monomial of z -variables with degree < 2k — 1. If F has 
no q-monomials, then G has no multilinear monomials of y -variables, i.e., G 
has no monomials of the format /3(f) such that /3 is a multilinear monomial of 
z-variables and (f> is a multilinear monomial of y -variables. 

Proof. We first show the second part of the lemma, i.e., if F has no q- 
monomials, then G has no multilinear monomials of y-variables. Suppose other- 
wise that G has a multilinear monomial /3<f>. Let cj> = (f>i(f>2 • • • 4> s such that <f)j is 
the product of all the y- variables in <j> that are used to replace the variable Xi , 
and let deg(^j) = dj, 1 < j < s. Consider the subtree T' of C that generates 
/3(f) when the a;- variables are replaced by a linear sum of y- variables according to 
expression ([2]). Then, the subtree T in C* that corresponds to T' in C computes 
a monomial 7r = x^x^ ■ ■ ■ xf* and is a multilinear monomial in the expan- 
sion of the replacement r(ir), which is obtained by replacing each occurrence of 
a;- variable with a linear sum of (q — 1) many y- variables by expression @. If 

there is one dj such that dj > q, then let us look at the replacements for xf J , 
denoted as 



r ( X h ) = II( Ctiyi + Ct2V2 + ■■■+ C t(q-l)V(q-l))- 
t=l 



Since dj > q, by the pigeon hole principle, the expansion of the above rix^ 1 ) 
has no multilinear monomials. Thereby, we must have 1 < dj < q — 1, 1 < j < 
s. Hence, tt is a q-monomial in F, & contradiction to our assumption at the 
beginning. Therefore, when F has no g-monomials, then G must not have any 
multilinear monomials of y- variables. 

We now prove the first part of the lemma. Suppose F has a (/-monomial 
7r = x^x^ ■ ■ ■ x^ with 1 < Sj < q — 1, 1 < j < t, and k — deg(7r). By Lemma 
[TJ F 1 has at least one monomial corresponding to n. Moreover, each of such 
monomials has a format aix such that a is a unique multilinear monomials of z- 
variables with deg(a) < 2k — 1. Let /3 = air be one of such monomials. Consider 
the subtree T' of C that generates (3. Based on the construction of C, T' has 
Sj terminal nodes representing Sj occurrences of Xi j in tt, 1 < j < t. By variable 
replacements in expression ([5]), /3 becomes r(/3) as follows: 



r((3) = ar{Tx) 
t 

n 



a 
1=1 



~\\{cijiyn + ciiiyii H V Cij{q-\)Vt( q ~\)) 



(8) 



where each occurrence j of Xi t is replaced by (cijiyn+cij2yi2+' • ■+ c ij(q~i)Vi(q-i)) 
For 1 < I < t, let ir t = x\\ , and 

Si 

r(ne) = JJ(%-iya + C£j 2 yi2 H (9) 
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Since 1 < s/i < q — 1, by expression ([3]), r(7r^) has a multilinear monomial ir' e 
with cocmcient ct such that 

^ = yayn---yis t , and (10) 

q = perm(C[7^,r(7^)]), (11) 
where the coefficient matrix, as defined in Definition [TJ is 





( cni 


cei2 ■ 


Cl\s t \ 




cm 


Q22 


C£2s e 




\ C£s e l 


Cls e 2 • 


Cisisi J 



Since the field T has characteristic 2 and all the entries in the coefficient are 
0/1 values, we have by Lemma [3] 

perm(C[7^,r(7rf)]) = det(C[7^, r(ne)]). 

Because each row of C[7r^,r(7r^)] is a uniform random vector in Z^ 1 , by Lemma 
[21 with a probability of at least 0.28, those row vectors are linearly independent, 
implying det(C[7r^, r(ng)]) = 1. Hence, by expressions (j9|), (fT0|) and (fTTj) . with 
a probability at least 0.28, r(iri) has a multilinear monomial ir' e . By expression 
((8]), with a probability at least 0.28* > 0.28 fc , ar(7r) has a desired multilinear 
monomial ock'-^k 1 ^ • ■ -7Tj. 

5. Randomized Testing of q-monomials 

Let d = log 2 (2fc — 1) + 1 and J 7 = GF(2 d ) be a finite field of 2 d many elements. 
We consider the group algebra T\Z\\. Please note that the field T — GF(2 d ) has 
characteristic 2. This implies that, for any given element w € J 7 , adding w for 
any even number of times yields 0. For example, w + w = 2w = w + w + w + w = 
4w = 0. 

The algorithm RandQMT for testing whether any given n-variate polynomial 
F(xx, X2, . . . , x n ) that is presented by a tree-like circuit C has a (/-monomial of 
degree k is given in the following. 

Algorithm RandQMT ( Ran domized q-Monomials Testing): 

1. As described in Subsection l4.11 reconstruct the circuit C to obtain 

C* that computes the same polynomial F{x\,X2, ■ ■ ■ ,x n ) and 
then introduce new z-variables to C* to obtain the new circuit 
C that computes F'(zi, z%, . . . , Zh, X\,X2, ■ ■ ■ , x n ). 

2. Repeat the following loop for at most ((f5g) fc times. 
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2.1. For each variable Xj and for each terminal node Uj repre- 
senting Xi in circuit C, select uniform random values Cijg 
from Z-i and replace Xi at the node Uj with 

r(xi) = {cijiyn + Cij2Vi2 H l-Cjj(9-i)^(9-i))- ( 12 ) 

Let 

G(zi, . . . ,Zh,Vll, • • • , , ■ • • ij/nl, • • • i2/n(g-l)) 

be the polynomial resulted from the above replacements 
for circuit C 

2.2. Select uniform random vectors Vij £ Z% — {vq}, and re- 
place the variable with [vij + vq), 1 < i < n and 

i < i < « - 1- 

2.3. Use C to calculate 

G' = G(z 1 ,...,z h ,(v 11 +v ),...,(v liq _ 1) +v ),..., 

(Vnl +V ),..., (v n ( q -l) + Vo)) 

= ^2fj(zi,---,Zh)-Vj, (13) 
j=i 

where each fj is a polynomial of degree < 2k — 1 over the 
finite field I = GF(2 d ), and ^ with 1 < j < 2 fe are the 2 fe 
distinct vectors in Z\ . 

2.4. Perform polynomial identity testing with the Schwartz- 
Zippel algorithm [2(| for every fj over T. Return "yes" if 
one of those polynomials is not identical to zero. 

3. Return "no" if no "yes" has been returned in the loop. 

It should be pointed out that the actual implementation of Step 2.3 would 
be running the Schwartz-Zippel algorithm concurrently for all fj, 1 < j < 2 k , 
utilizing the circuit C. If one of those polynomials is not identical to zero, then 
the output of G' as computed by circuit C is not zero. 

The group algebra technique established by Koutis [18| assures the following 
two properties: 

Lemma 5. (fl&]) Replacing all the variables yij in G with group algebraic 
elements Vij + Vq will make all monomials air in G become zero, if n is non- 
multilinear with respect to y-variables. Here, a is a product of z -variables. 

Proof. Recall that T has charateristic 2. For any v € Z% , in the group algebra 

(v + V ) 2 = V ■ V + 2 • V ■ Vq + Vq ■ V 

= ?7 + 2 • v + v 

= 2-v + 2-v = 0. (14) 
Thus, the lemma follows directly from expression ([T4| . 
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Lemma 6. (flaj) Replacing all the variables yij in G with group algebraic 
elements v*ij + Vq will make any monomial air to become zero, if and only if the 
vectors are linearly dependent in the vector space Z\. Here, ir is a multilinear 
monomial of y-variables and a is a product of z-variables. Moreover, when it 
becomes non-zero after the replacements, it will become the sum of all the vectors 
in the linear space spanned by those vectors. 

Proof. The analysis below gives a proof for this lemma. Suppose V is a set of 
linearly dependent vectors in Z\ . Then, there exists a nonempty subset T C V 
such that # = «b- For any SCT, since fl^T v = (UveS $ ) ' (HveT-S & )> 

we have Ilses ^ ~ ITueT-s ^ Thereby, we have 

ver serves 

since every Hcres v IS paired by the same J^ i7gT _ i g v in the sum above and the 
addition of the pair is annihilated because T has characteristic 2. Therefore, 



v£V 




Now consider that vectors in V are linearly independent. For any two distinct 
subsets S, T C V, we must have IliTeT & ^ IlijGS ^ because otherwise vectors in 
S U T — (S (~l T) are linearly dependent, implying that vectors in V are linearly 
dependent. Therefore, 

n (*+*>) = En* 7 

is the sum of all the 2' y ' distinct vectors spanned by V. 

Theorem 1. Let q > 2 be any fixed integer and F(x±, X2, ■ ■ ■ , x n ) be an n- 
variate polynomial represented by a tree-like circuit C of size s(n). Then the 
randomized algorithm RandQMT can decide whether F has a q-monomial of 
degree k in its sum-product expansion in time O* (7 .15 k s 2 (n)) . 

For applications, we often require that the size of a given circuit is a poly- 
nomial in n. in such cases, the upper bound in the theorem becomes 0*(7.15 k ). 

Proof. From the introduction of the new z-variables to the circuit C , it is easy 
to see that every monomial in F' has the format an, where it is a product of 
a;- variables and a is a product of z-variables. Since only a;- variables are replaced 
by respective linear sums of new y-variables as specified in expression (|12[) (or 
expression |2])), monomials in G have the format /3<f>, where is a product of 
y-variables and (3 is a product of z-variables. 
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Suppose that F has no q-monomials. By Lemma 0] G has no monomials 
/3cj> such that is a multilinear monomial of y-variables and j3 is a product of 
z- variables. In other words, for every monomial in G, the y- variable product 
4> must not be multilinear. Moreover, by Lemma El replacing y-variables will 
make cj> in every monomial j3<p in G to become zero. Hence, the replacements 
will make G to become zero and so the algorithm RandQMT will return "no". 

Assume that F has a g-monomial of degree k. By Lemma[4l with a probabil- 
ity at least 0.28 fc , G has a monomial f3<f> such that is a y- variable multilinear 
monomial of degree k and j3 is a z-variable multilinear monomial of degree 

< 2k — 1. It follows from Lemma El a list of uniform vectors from Z% will be 
linearly independent with a probability at least 0.28. By Lemma[51 with a prob- 
ability at least 0.28, the multilinear monomial <j> will not be annihilated by the 
group algebra replacements at Steps 2.2 and 2.3. Precisely, with a probability 
at least 0.28, [3<f) will become 

2 fc 

xm = Y,pv r , (is) 

i=l 

where Vi are distinct vectors in Z%- 

Let S be the set of all those multilinear monomials j3<f> that survive the group 
algebra replacements for y-variables in G. Then, 

G' = G(zx,...,z h , (vn +wo),...,(«i(g_i) +tf ), 

(Vnl +V ),..., («n(<?-l) + «b)) 

= E A (^) 

!3<t>eS \i=l ) 

= E ( E ] *i ( 16 ) 

Let 

fj{z u ...,Zh) = E /3 - 

By Lemmas 2] and [5l the degree of /3 is at most 2fc — 1. Hence, the coefficient 
polynomial /j with respect to Vj in G' after the algebra replacements has degree 

< 2k — 1. Also, by Lemma IH /3 is unique with respect to every for each 
monomial /3</> in G. Thus, the possibility of a "zero-sum" of coefficients from 
different surviving monomials is completely avoided during the construction of 
fj. Therefore, conditioned on that S is not empty, F' must not be identical to 
zero, i.e., there exists at least one fj that is not identical to zero. At Step 2.4, 
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we use the randomized algorithm by Schwartz-Zippel |20( to test whether fj is 
identical to zero. It is known that this testing can be done with a probability 
at least = | in time polynomially in s(n) and log 2 J 7 ! = 1 + log 2 (2fc — 1). 
Since S is not empty with a probability at least 0.28, the success probability of 
testing whether G has a degree k multilinear monomial is at least 0.28 X | > |, 
under the condition that G has at least one degree k multilinear monomial. 

Summarizing the above analysis, when F has a g-monomial of degree k with 
a probability at least 0.28 fe , G has a degree k multilinear monomial <j) of y- 
variables in the format j3<j> with coefficient /3 that is a multilinear monomial of 
z-variables with degree < 2k — 1. Thus, the probability that G does not have 
any degree k multilinear monomials of y- variables in the aforementioned format 
(3(f> in its sum- product expansion during any of the (n^g) loop iterations is at 
most 

(1 - (0.28) k ) { ™ )k < -. 
v ' e 

This implies that the probability that G has at least one degree k multilinear 
monomial during at least one of the (q^s) loop iterations is at least 

1 

1 - -. 

e 

When G has at least one degree k multilinear monomial <f> of y-varaibles in the 
format /?</> as described above, the group algebra replacement technique and 
the Schwartz-Zippel polynomial identity testing algorithm as analyzed above 
will detect this with a probability at least g. Therefore, when F has one q- 
monomial in its sum-product expansion, with a probability at least 

1 



algorithm RandQMT will detect this. 

Finally, we address the issues about how to calculate G' and the time needed 
to do so. Naturally, every element in the group algebra T\Z^\ can be rep- 
resented by a vector in Z\ . Adding two elements in T\Z\\ is equivalent to 
adding the two corresponding vectors in Z\ , and the latter can be done in 
0(2 k ) time via component-wise sum. In addition, multiplying two elements in 
J-\Z^\ is equivalent to multiplying the two corresponding vectors in Z\ , and 
the latter can be done in 0(k2 k log 2 \J-\) = 0(k 2 2 k ) with the help of a simi- 
lar Fast Fourier Transform style algorithm as in Williams [22^ . Calculating G' 
consists of n * s 2 (n) arithmetic operations of either adding or multiplying two 
elements in T\Z\\ based on the circuit C' . Hence, the total time needed is 
0(n*s 2 {n)k 2 2 k ) = 0*{2 k s 2 (n)). At Step 2.4, we run the Schwartz-Zippel algo- 
rithm on G' to simultaneously test whether there is one fj such that fj is not 
identical to zero. The total time for the entire algorithm is 0*(2 k s 2 (n) ■ (o^g) k )- 
Since 

2x^2x^<7.15, 
0.28 28 

the time complexity of algorithm RandQMT is bounded by O* (7 .15 k s 2 (n)) . 
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6. Concluding Remarks 



The group algebra ap pro aches to testing multilinear monomials 18, 22] and 
g-monomials for prime q HI, T3| rely on the property that Zi and Z q are fields 
for primes q > 2. These approaches are not applicable to the general case of 
testing g-monomials, since Z q is no longer a field when q is not prime. In this 
paper, we have developed a variable replacement technique and a new way to 
reconstruct a given circuit. When the two are combined, they help us transform 
the g-monomial testing problem to the multilinear monomial testing problem 
in a randomized setting. We have also proved that the transformation has the 
desired success probability to warrant its application to the design of our new 
algorithm. 

It should be pointed out that the time complexity of the randomized q- 
monomial testing algorithm obtained in [101 ] runs in time 0*(q k ) for prime q > 2, 
when the size of the circuit is a polynomial in n. Algorithm RandQMT runs 
in time 0*(7.15 fe ), hence it significantly improves the time complexity of the 
algorithm in fioj for prime q > 7. 
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